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Abstract 

We show two Conductance-like theorems for mixing time of non-reversible non-lazy walks; the 
first holds for walks with small holding probability, while the second theorem holds even for 
walks with no holding probability. These are used to derive two canonical path theorems for 
such non-reversible non-lazy walks. As an application we show that a known bound for mixing 
time of walks on undirected Cayley graphs applies to all finite directed Cayley graphs. 
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1 Introduction 

The notion of conductance || has been key to studying the convergence rate of many random walks. 
There have been many improvements on this original concept: the method of average conductance 
showed better mixing bounds for lazy reversible walks if small sets have high conductance @, 
evolving sets were used to apply this to lazy non-reversible walks § , modified conductance extends 
this to non-lazy non-reversible walks f8|, and blocking conductance shows improved bounds for a 
lazy reversible walk when ergodic flow is not heavily concentrated on a few vertices ||. 

In this paper we show a mixing time bound which encompasses all of these concepts. This is 
done by using evolving sets to study general finite Markov chains in a fashion resembling blocking 
conductance. In particular, when measuring ergodic flow from a set we count flow into each vertex 
only up to some ratio r of its size; if the flow is well distributed among vertices then this will 
differ little from regular ergodic flow even for fairly small values of the ratio r. Two versions of 
our bound are shown, when there is a non-zero holding probability at every vertex or when there 
is not, although up to a constant factor the latter bound is stronger than the former. 

These new results are then used to show two canonical path theorems for general finite Markov 
chains. The first requires only a small non-zero holding probability at each vertex, while the latter 
requires none. This does away with the largely artificial assumption of strong aperiodicity or re- 
versibility required for most canonical path analysis, conditions which are undesirable as algorithms 
are almost never implemented with a large holding probability and as a non-reversible walk may 
converge faster than a reversible walk. Although not directly comparable to past bounds, our result 
says roughly that the Poincare approach to bounding mixing times |2|, [3|, applies in the general 
setting if the maximum path length is replaced by a notion of maximum vertex congestion. Average 



path length and average vertex-congestion are equal (see Remark 4.6), and although it is usually 



easier to keep path length small than to smooth the paths among all the vertices, when looking at a 
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non-reversible walk it may be worth the extra effort to attempt this so that the artificial assumption 
of a large holding probability can be dropped. 

As an application of sorts, we show that known complexity results for the (non-lazy) simple 
random walk on an undirected graph with self-loops apply just as well to the simple random walk 
on an Eulerian graph with self-loops (i.e. strongly connected with in-degree=out-degree at each 
vertex), and often even without self- loops. Other canonical path methods result in the loss of a 
factor of max-degree d. A more interesting problem is to study random walks on Cayley graphs 
(i.e. on groups), for which we show that known bounds El |2[ for a lazy walk or a walk with a 
symmetric set of generators can be extended to the non-lazy non-symmetric case. 

Our paper proceeds as follows. We review past methods in Section ||. In Section || we show our 
conductance-like mixing results when ergodic flow is spread over many vertices. Then in Section |] 
canonical paths are used to bound mixing time in terms of notions of edge and vertex-congestion. 
We finish with a few examples in Section [|. 



2 Preliminaries 

Throughout the paper we consider a finite ergodic Markov kernel P (i.e. transition probability 
matrix) on state space V with stationary distribution tt. The walk has minimal holding probability 
a if \/v € V : P(v,v) > a, is called lazy if we may take a > 1/2, and is reversible if P* = P 
where the time-reversal P*(cc, y) = The ergodic flow from A C V to B C V is Q(A, B) = 

YlxeA yeB ^(x)P(x, y). The chi-square distance between distributions a and it is 

TT 2,n ^ \ir(v) J 

The I? mixing time T x (e) = min{n | \\k* — 1\\z,tt < £ } f° r density k*(y) = is the time for 

chi-square distance to drop to e 2 when started at state x £ V. 

In order to understand what is new about our results it is necessary to review earlier work on 
conductance and canonical path methods. 



2.1 Conductance methods 

Jerrum and Sinclair j|] used an argument involving spectral gap to show that mixing time of 
a lazy reversible walk can be bounded in terms of the conductance $ = min^4 C y &(A) where 
<&(A) = min^^'^^c)} • The method of spectral profile || can be used to extend this to consider 
set sizes for even non-reversible non-lazy walks, with 

f4A 



/ 2ds 



where the conductance profile is given by <fr(s) = min^^wo^ $(A) if set size s £ (0,1/2], and 
$(s) = $(1/2) if s > 1/2. When there is no holding probability then a few approaches are possible 
(see 0] and Section 4.4 of @), with 

4 / e2 Ads , _ f 4 /' 2 2ds 



f 1 Ads f 
r x (e)< / —2 — and r x {e) < / 

J4n(x) S<Ppp*(S) Jin(x) S(j) z (s 
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where 3>pp*(s) is conductance profile of PP*, and the modified conductance profile is 
6(s) = inf 4>{A) where 4>(A) = , — --— -r 

and 

(A) = min Q(A, 5) + ~ Q(A, v) . 

BcV,veV, Tt(v) 
n(B)<n(A c )<n(BUv) 

This is the ergodic flow from A to the worst set of size tt(A c ), in contrast to Q(A,A C ) which is the 
ergodic flow to a specific set of size ir(A c ). In Lemma 4.19 of [||] it is shown that <f>(A) > ^<l?pp*(A), 
so up to a small constant modified conductance is the better method. 

2.2 Canonical Paths 

Before discussing canonical paths results we require some notation: 

Definition 2.1. For every x, y S V, x ^ y, define a path j xy from x to y along edges of P, and let 
r be the collection of these paths. The edge- congestion is given by 

Pe = Pp (T) = max - -— — — ir(x)ir(y) 

The maximum path length is £ = max 7isg r \lxy\- 

Canonical paths are used to bound mixing time by considering edge-congestion alone ||, or 
through a Poincare inequality involving maximum path length as well [||, 10, 0. Combining these 



results in a bound for general Markov chains with non-trivial holding probability a: 

r x (e) < min{4p e , £} log j_ (2.1) 

The Poincare case can be extended to non-lazy reversible walks 0, if V contains only paths 
of odd length and also has a path from each vertex x to itself: 

r x (e) <2p e (r')l\og—L== (2.2) 
ey'TT(x) 

This suggests that for reversible walks if a is small then the Poincare bound is often too pessimistic. 
We will see a similar phenomenon is true even for non-reversible walks. 

When a general finite Markov has little or no holding probability then the — term will blow 
up. Then canonical paths can be used to bound the spectral gap App* of PP* by taking canonical 
paths along edges of walk PP*, i.e. paths of even length with edges drawn alternately from P and 
then P*. This leads to a bound of 

r x (e) <2p e pp *min{p e pp ^l°g^_ (2 3) 

However, the minimal transition probability of PP* is usually the square of that of P (e.g. usually 
minPP*(x,y) = minP(x,y) 2 if ir is uniform), making p PP * quite large. 
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3 Mixing bounds with r-conductance 



Our extensions of past conductance results will be in terms of a measure which counts ergodic flow 
only up to some ratio r of the vertex size. When ergodic flow is well distributed among vertices 
then it may be possible to make r quite small without decreasing the ergodic flow significantly, and 
so the choice of r will measure some form of vertex congestion. 

Definition 3.1. If A, B C V and r G [0, 1/2] then the r-ergodic flow from A to B is 

Q r (A£) = X>) min{^,/}. 

The r-ergodic flow of A is Q r (A) = min{Q r (A, A c ), Q r (^4 c ,yl)}. The r-conductance profile is 
4> r (s)= inf $ r (A) where $ r (A) = Q ^ A \ . 

if s € (0, 1/2], while if s > 1/2 then $ r (s) = $ r (l/2). 

When r > 1 - a then Q r (A) = Q(v4,v4 c ), and so for a lazy walk Qi/ 2 (^) = Q(A,A C ) and 
^1/2(^4) = ^(^4) agree with standard notions of ergodic flow and conductance. More generally, if 
ergodic flow from A to A c and from A c to A are not unduly concentrated on a few vertices then 
& r (A) f» &{A) even for fairly small r. In such a situation our first result can lead to a substantially 
improved mixing bound: 

Theorem 3.2. For a finite ergodic Markov chain if r < a then 

^A 2 4min{r,l - r} ds' 



r x (e) < 



4tt(x) S$ r (s) 2 



This can be a factor 1/ar times better than the Spectral profile result, although that used 
normal conductance instead of r-conductance. As will be discussed later, a natural application is 
a canonical path argument in which the paths are well distributed over the vertices. 

Our second result is similar to the first, but for walks with no holding probability. As such we 
require an expression resembling modified conductance, but also having an r cutoff. 

Definition 3.3. If A C V and r G [0, 1] the r-modified ergodic flow is 

r (A) = Q r (A, V) - rvr(A) = V vr(y) min { r) - rir(A) . 

The r-modified conductance profile is given by 

4> (s) = mm (A) where (A) = ttt^ , 

7r(A)e(0,s] vr^jTr^/L ) 

if a € (0, 1/2], while if s > 1/2 then r (s) = r (l/2). 
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To gain some intuition into this definition, note that if ir(A r ) > n(A) (see Definition 3.5) then 
$? r (A) is the minimal r-ergodic flow from A into a set of size tt(A c ). In particular, if r < a then 
^r(A) = Q r (A,A°). More generally, ^ r (A) < Q r (A,A c ) always. 

This time we can show a mixing time bound requiring no holding probability: 

Theorem 3.4. For a finite ergodic Markov chain if r G (0, 1] then 

f4/e 2 



r x {e) < 



12r ds 



( x ) smin{^ r (s) 2 ,r<^ r (s)} 



Itt 



This improves on Theorem |3.2| up to a constant factor because the bound is decreasing in 4> r (s), 
and so if r < min{a, 1/2} then </> r (s) = <Ev(s) < 2r. The case of 1/2 < r < a reduces to the case of 
1 - r, since min{r, 1 - r} > 1 - a and so Qi_ r (A) = Q(A, A c ) = Q r {A). 

The unusual minimum in the denominator is necessary. With only the 4> r (s) 2 term the lazy 
simple random walk on a complete graph K n with loops (i.e. P(i,j) = ^- if i ^ j and P(i, i) = 
\ in) a ^ r = min{s/2,l/2} (the proof allows r to be increasing in s) would give the incorrect 
t x (1/2) = O(l), whereas with only the r<f) r {s) term the simple random walk P(i,i ± 1) = 1/2 on a 
cycle Z n for n odd at r = 1/2 would give the incorrect r x (l/2) = 0(n). 

The proofs of both new bounds on mixing times are shown by use of the Evolving set method- 
ology H]. To understand the method requires some new terminology: 

Definition 3.5. Given AcV and u G [0, 1] let 

A u = {yeV | Q(A,y)>wr(y)} = {yeV | P*(y,A)>u}. 
The root profile ip : [0, 1] — >• [0, 1] is any function satisfying 

J^TT(A u )(l-7T(A u ))du 



^7t(A)(1-7t(A)) 



ifi(s) < min ij)(A) where ip(A) = 

tt(A)<s 

when s G (0, 1/2] and ^(s) < ^(1/2) when s > 1/2. 

Since Q(A,v) = tt(v) then the set A u indicates the set of vertices receiving at u-fraction of 
their inward flow from A. If tt(A u ) differs significantly from tt(A) when u G uar [0, 1] then this 
indicates the random walk expands quickly from set A into the full space. Since Jq it(A u ) du = 

T.yev^iy)^^ 1 = Q(AK) = n(A) and f(a) = y/a(l - a) is concave then f(i:(A u )) du < 
f(fo ^(Au) du) = f(ir(A)), and so large root profile indicates that ir(A u ) differs significantly from 
tt(A) if u G uar [0,1], i.e. the random walk diffuses quickly. Morris and Peres || related ij)(A) = 



to mixing time, although we use a sharper form of M (found after equation (4.5)): 



Theorem 3.6. For a finite ergodic Markov chain 

r-4/e 2 



Tx(e) < < 



ds 



4tt(x) Slj}(s) 

^ 2 ds 



in general 



if si) 



is convex 
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Evaluating i/j(A) is primarily an optimization problem subject to whatever assumptions are 
given in the problem to be solved. It is often useful to recall that J n(A u ) du = tt(A) and note 
that n(A u ) is a decreasing function of u. 

To show Theorem |3.2| it then suffices to lower bound ip(A) when r < a. Actually, when 
1/2 < r < a then min{r, 1 — r} > 1 — a and so Q\- r (A) = Q(A,A C ) = Q r (A). Hence it suffices to 
consider the case when r < min{a, 1/2}. 

Lemma 3.7. If A C V and r < minjo:, 1/2} then 



4>(A) > 2r ^1 - y/l- § r {A) 2 /Ar 2 ^j > 



4r 



Proof. First consider the conditions of the lemma. If v E A then Q(A,v) > Q(v,v) > an(v), and 
so if u < r < a then A u = A U {y G A c \ Q(A, y) > un{y)}. Hence 



/ 

Jo 



(\(A u )du = rn(A)+ ^ ^(y) min |^^, r| 



nr(A) + Q r (A, A c ) > rir(A) + Q r (,4) . 



Likewise, L n{A u ) du < rir(A) — Q r (A). 

Now for the optimization. Let x = Jq(tt(A u ) — ir(A))du and y = fi_ r (ir(A) — ir(A u ))du. 

Note that x,y > Q r (A), while J ' (ir(A u ) — ir(A))du = y — x because ir(A u ) du = tt(A). By 
Jensen's Inequality J f(g)dfj, < f(J gdfi) if / is concave and fj, is a probability measure, and so if 
f(a) = a/o(1 — a) then 



f 1 r du f 1 du r 1 ^ du 

/ f(n(A u ))du = r f(n(A u ))- + r f( 7r (A u ))- + (l-2r) / f{ir{A u )) — 

JO JO r Jl-r r Jr 1 - 2r 



< r /(tt(A) + x/r) + r f{v{A) - y/r) + (1 - 2r) / (jr(A) + ^— |- j = : h ( x > V) 

-f'(ir(A) - {£) + f'(7r(A) + < °- Likewise, 



By concavity /' is non-increasing, and so §jj 

fH y <x = /'( 7r ( j4 ) + f)-/ , ^( yl ) + S) Hence h is maximized when y = x. But £h(x,x 
f'(n(A) + f ) - f'(n(A) - f ) < 0, and so £ /(tt(A u )) < y) < h(Q r (A), Q r (A)). 
Putting together these various facts we have that 

1 - m < 1 - 2r + r /(^) + Qr (^)/r) + /(vr(A) - Q r (A)/r) 



2 2rvr(A)yV 2 2rvr(A c )y V \2 2rir(A) J \2 2m{A c ) 



< 1 - 2r + 2ryi - § r {A) 2 /Ar 2 < 1 - $ r (Ay/4r 

by Lemma ^ with X = \ + J^Jl and F = | - , and lastly ^/T^x < 1 - x/2. □ 

The following small inequality was required in the proof. 
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Lemma 3.8. If X, Y € [0,1] then 

g(X,Y) = ^XY + v / (l-X)(l-Y) < y/l — (X — Y) 2 . 

Proof. Observe that 

g(X,Y) 2 = 1 - (X + Y) + 2XY + - (X + Y) + 2XY] 2 - [1 - 2(X + Y) + (X + Y) 2 ] . 

Now, VA 2 - B < A - B if A 2 > B, A < and A > B (square both sides to show this). 
These conditions are easily verified with A = 1 - {X + Y) + 2 X Y and B = 1 - 2(X + Y) + (X + Y ) 2 , 
and so 

g(X,Y) 2 < 2[1-(X + Y) + 2XY]-[l-2(X + Y) + (X + Y) 2 } 
= 1 + 2XY - X 2 -Y 2 = 1- (X -Y) 2 

□ 



Finally, the following lemma suffices to prove Theorem |3.4| : 
Lemma 3.9. If A C V and r G (0, 1] then 

4,(A) > ^ min {fr (A) 2 , rf (A)} 

Proof. It suffices to assume that r E (0, 1) because the case of r = 1 follows by taking a limit. 

Observe that (A u ) c = (A c )\_ u for every u such that W G V : Q(A u) 7^ uir(v), and in particular 
■u-a.e., so = ^(^4 C ). Moreover <^ r (A) = r (^4 c ). The lemma then holds for set A if and only if 
it holds for A c . Thus, without loss we assume that ^ r {A) > ~& r (A c ), since if ^ r (A c ) > V r (A) then 
* r (J3) > y r (B c ) for B = A c . 

As before, the quantities in the lemma can be rewritten in terms of A u : 

I* tt(A u ) d U =Y] vr(y) min (^^, r) = rir{A) + tf r (A) 

io ^ I *(v) J 

Since 7r(^ u ) = it(A) then J^ 1 7r(A u ) du = (1 - r)7r(A) - * r (A). 
Consider the case when (/> r (A) = ^rlr^x^ < 1 — 2r. Then 



1 - ip(A) 



du_ 
r 



y/n(A)(l-*(A)) 



< r 



1 | ^(A)(1-tt(A)) \ / <p(A)n(A) 



+(l-r) 



^(A)(1-tt(A)) \ / | ^(A)tt(A) 
1 — r / \ 1 — r 



< J 1 + ^-l^/^)!,£M}V(i- r )fi m 



2r 12 I r 2 ' r [ I v M 2(1 - r) 
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The first inequality is Jensen's with f(x) = y/x(l — x) and some simplification, the second is 
because if 4> r {A) < 1 — 2r then the equation is concave as a function of x = tt(A) £ [0, 1/2] with 
negative derivative at x = and hence maximum at x = 0, while the final line applies the relations 
\/l + x < 1 + § — mm ^2 and VI — x <1 — fifx>0. This simplifies to give the lemma. 

When <jf{A) > 1 — 2r then the maximum in the second inequality above is instead at x = 

7r(i)= i-(i-y(i) )andso 



l-^i(A) < VI - <t> r {A) 2 < 1 

If r > 1/3 then > £g£ while if r < 1/3 then ^ > (IzHEl^ > i n eit her case 

the above equation establishes the lemma. □ 



4 New Canonical path bounds 

When ergodic flow leaving a set is not heavily concentrated at a few vertices then our r-conductance 
can improve substantially over the conductance method of ||. In this section we explore such a 
situation by considering the case when canonical paths are well distributed among the vertices, 
and thus the ergodic flow induced by canonical paths is not heavily concentrated at any vertex. A 
notion of vertex congestion will thus be required: 

Definition 4.1. Given paths V the vertex congestion is 

/ \ 

vr(x)vr(y) + - ^ vr(x)vr(y) 



pv = Pv(T) = max- 



vev ir(v) 



\v£-y X y\-yy X ve-y xy n-y yx J 



The minimum boundary probability is 

P (r) = min{P(a, b) \ 3x, y : (a, b) £ ^ xy ] . 

This form was chosen to ensure that a vertex v is only counted in one of the paths j xy or j yx , 
and so undirected paths are counted only once. To simplify this note that v € 7„ x D j X v for all 
x £ V, and so 

p v < max — - V ir(x)ir(y) . (4.4) 
vev ir(v) ^-^ 

Jx V Bvjtx 

Also, in an undirected graph if ^ yx is just j xy traversed in reverse then 

Pv = \ ma x-7-T V] vr(x)7r(y) . (4.5) 

It will be seen that Q pv / Pe > 1/Pe, and from this we derive our first path bound: 
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Theorem 4.2. For a finite ergodic Markov chain, if = mmir(v) and e < y/2 then 

T x (e) < Ap v maxj — , p e \ log — -L 
L a > e v / 7r( 



r(x) 



— — : — + 4p,, max ^ — , p e flog 

mm{a, P (r)j la J e^op,, 



Remark 4.3. Since p„ < p e (l - a) and p e < tfterc max {^,p e } < ^ p e , min{a % (r)} - 

The maximum and TTop v terms are not simply artifacts of the proof. A simple illustration 
of this is the walk on a cycle Z n with P(i,i) = a G (0,1) and P(i,i + 1) = 1 — a. With the 
obvious choice of paths this has p v = p e = 2(1 -a) ' ^o(r) = 1 — a, 7To = 1/n and mixing time 

r x ( e ) = 6 f r log ±) . 

When there is no holding probability then we can show a bound which combines aspects of both 
the non-lazy canonical path methods discussed in the preliminaries, in that we require paths of odd 
length with edges drawn alternately from P and then P*. A slightly different vertex congestion is 
required in this case: 

Definition 4.4. For each x, y G V (including x = y) define an odd length path j xy from x to y 
alternating between edges of P and P*, 

p p* p p* p 

X = X — * Xl > X2 — * X3 ►•■■—»■ X 2n +1 = V ■ 

P P* 

Denote (u,v) Gp j X y if u — * v or v — ► u appear in path j xy , i.e. the directed edge (u,v) of P 
was used in constructing the path (as edges of P* are reversals of edges of P). Also, v £ Q dd j X y if 
some (u,v) Gp j xy , i.e. v = X21+1 for some i G [0, n\. 

Let T be the collection of these paths. The vertex- congestion is 

p v = p„(f) = max — !— V 7r(x)vr(y). 
vev TT(V) , . , 

{x,y\v£oddlxy} 

The minimum reversed boundary probability is 

PS(f) = min{P*(&,a) | 3x, y : (a, b) G P j xy } . 

We will find that </> p o(^) > P °j r ^ ; from which we derive our general result: 
Theorem 4.5. For a finite ergodic Markov chain, if e < 1 then 

40p2 , 1 



r x (e) < 



log- 



P5(r) e\/^op7 
w/iere <5 = mm AiBcV {\n(A) - tt(B)| | tt(A) / tt(B)}. 

The requirement of odd length paths is necessary because a simple random walk on a cycle 
with an even number of vertices does not converge, while paths must alternate between P and P* 
because a clockwise walk on a cycle of odd length (i.e. P(i,i + !) = !) does not converge. 



9 



This is not directly comparable to Theorem 4.2. However, given paths T along edges of P one 
can define paths ^ xy £ T by taking the P steps to be self-loops and the P* steps to be the reverse 
of path ^ yx (define r y xx = so j xx is a self- loop). Then 



1 + max . . tt(x)tt(ii) 



Pv 



vev ir(v) 



while Pg(T) = min{a, Po(r)}, and so Theorem 4.5 resembles the second bound of Theorem 4.2 
under the simplifications suggested in and Remark [4.3| . 

Remark 4.6. It may seem odd that for a lazy walk there are three forms of the mixing bound: the 
original conductance bound involves p e p e , the Poincare bound uses p e £, while our bound has p e p v . 
Certainly p v < p e so our bound strictly improves on the original conductance result. Also, if we 
define average vertex congestion and average path length by 



pT 



1 



ir(v) 



Y vr(x)vr(y) 



and 



T. x ^ y& y^{ x )^{y)\lxy\ 



respectively, then 



pT 



Y £ n(x)Tr(y) 

x^yeV -f xy 3v^x 

iave Y 7r ( x ) 7r (y) 



Y 7r ( X ) 7r (y) l7ay I 
x^y£V 



7T 



xj^yeV 



Hence vertex- congestion and path length are of similar sizes when the canonical paths are well 
distributed among the vertices. When a few paths are very long, or the distribution is concentrated 
at a single vertex, then vertex- congestion can be much smaller than maximum path length, although 
it is more often the case for maximum path length to be smaller than vertex-congestion. 



To prove Theorem |4.2j we require a lower bound on an appropriate choice of r-conductance. 
Lemma 4.7. Given cycle-free paths {^xy} between each x,y S V, x ^ y, then 

® Pv/pe {A)>l/ P e. 

Proof. For each pair of vertices x £ A, y £ A c , x ^ y, transport flow of Tr(x)ir(y) along the path 
■y xy from x to y, for a total of tt(A)tt(A c ) from A to A c . Then, if b G A c , 



tt(6) > 



Pv 



Y ^OM?/) = — XI Yl K(x)ir(y). 



(x,y)£AxA c 

b£~/xy 



aeA (x,y)eAxA c 
(a,b)ejx V 



Likewise, if a E A and b £ A c then 



7r(o)P(o,&)>- Y < X XV) 



(x,y)€AxA c 
(a, b)£~ix y 
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Hence, 



Q Pv/Pe (A,A c ) = min(j>(a)P(a,&), ^ tt(6) 



> Y Y ~ Y 7r ( x ) 7r (y) 



Pe 

beA c aeA r (x,y)£AxA c 
(a,b)ey xy 

Pe 



— Y Tr(x)Tr(y) \{(a,b) £ Ax A c \ {a,b) e-f xy }\ 
Pe {x,y)eAxA c 
1 , . . , it(A)-k(A c ) 



Pe (x,y)£AxA c Pe 

A similar argument lower bounds Q Pv / Pe (A c , A). □ 
It is then not hard to show our first canonical path result: 



Proof of Theorem \4-3j - Remove any cycles in the paths. This can only decrease p v and p e , and 



hence improve the mixing time bound. If p v /p e < a then i(j(A) > l/4p v p e by Lemmas and 4.7. 
If, instead, p v /p e > a then at least $ a (A) > - °j p & Pv / Pe (A) > a/p v , and so ip(A) > a/Ap 2 ,. Hence 



4p v max{p v /a, p e } 



The convexity condition of Theorem 3.6 is satisfied by this lower bound, and so the first mixing 
time bound follows by a simple integration. 

For the second mixing time bound we improve bounds on r-ergodic flow from small sets. Since 
Q Pv / Pe (A, A c ) > was shown using only edges in paths of T, there exists (a,b),(x,y) S A x A c 
with (a, b) 6 ^ xy , and so Q(A,b) > Q(a,b) > 7ToPo(r). Then Q Po ( r )(^4) > 7ToPo(r), and similarly 

Qp (r)(^4 c ) > TToPo(r)- ^ follows that 5>p (r) > 7t^a^(A c ) ' anc ^ as m * ne P rev i° us paragraph then 

(T> r , , > 7tq min{q,P (r)} R t lq-71 
*min{Q,Po(r)} > 7r(A)7r(A c ) " ^ LenUna B 



V>(^4) > max 



1 min{a,P (r)}vr2 



4p v max{ Pv /a, Pe } ' 4tt(A) 2 tt(A c ) 2 
The convexity condition of Theorem |3.6| is easily verified for this lower bound and so 

r°P- 2s ds r Pv 1 f 1 ^ 2 ds 

T x {e) < / — — - — 2 + 2^max^ — , p e \ \ — 

J no mmja, P (r)}vr^ la > J mpv s 

if ttqPv < 1/e 2 , which integrates to give the second bound in the theorem. Since irop v < = \ 

then it suffices that e < \f 7 l. □ 

For the canonical path theorem without holding probabilities we require a lower bound on an 
appropriate choice of r-modified conductance. 
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Lemma 4.8. If A is a proper subset of V then 



ran 



04) > 



Proof. Define -E = {(cc, y) | 3a, 6 : (x, y) Gp ■j ao } to be the edge set used in paths. Since v £ dd ixv 
then every vertex is an endpoint in E and we may partition the state space into vertices with 
incoming edges in E from A, A c , or both: B = {v £ V \ (x, v) £ E =>- x G ^4}, C = {v £ V \ 
(x,v) £ E ^ x £ A c }, and D = V \ (B U C) = {v £ V \ 3x £ A, y £ A c ; (x, v) £ E; (y, v) £ E}. 
Then 

* P5( f)(^) > Q PS( f)(^, BUD)- P5(f>(A) = P* (t)(ir(B U D) - tv(A)) . 

Likewise, ^ p , (t) (A c ) > P*(f )(tt(C Ufl)- vr(A c )) = P*(f)(7r(A) - tt(5)). 

We now lower bound 7t(D). If x £ A and y £ V \ B then on path 7^ if X2i+i € £? then 
%2i+2 S -4, so the first vertex X2«+i ^ 5 will be in D, and in particular \{a £ D : a £ dd ixy}\ > 1- 
But then 



tt(d) = £>( a )>^ E 

aeD aeD (x,y)eAx(V\B) 

a &oddixy 



7r(x)7r(y) 



> 



(a;,i/)eAx(V\B) 

^(A)vr(y \B 



7r(x)7r(y) 



Pv 



\{a £ D : a £ odd % y }\ 



If ir(B) < tt(A) 

n(A)n{A c ) 



n(A)n(A c ) 
2p v 



n(A) - n g } then ir(B U D 



then ^p 5( r)(A 
)-tt{A) > n(B) + 



> p o(r r (A 2^ (AC) - On the other hand, if tt(B) > 



7T(A)(l-7r(B)) 



7T(A) > 



7r(A)7r(A C ) 

2p„ 



It follows that 
□ 



Proof of Theorem \4-^ . Observe that p v > 1 because t> G dd 7«« f° r every u, v £ V. Then proceed 
as in the second bound of Theorem 4.2, but with <fi p o( r }(A) > 



-Pg(r) 



(A) 



> 



P5(r)<?o 



used when 



?r(A) — 7r(A) 

7t(t4) E (710, 25o/j„], and Lemmas 3^ and -L8 in place of Lemma s |3.7| and |4 7| . This gives a coefficient 
of 48. To reduce this to 40 observe that in the proof of Lemma |3^ we may use the stronger relation 
y/T+x < 1 + f - fj when x £ [0, 1/2] to show that 41(A) > when 4> r (A) < § 



□ 



5 Examples 

We present two examples where our canonical path theorems extend previously known bounds into 
the general non-reversible non-lazy case. First, the classical problem of the max-degree walk on a 
graph, then walks on Cayley graphs. 

Example 5.1. Consider an Eulerian multigraph with n vertices and max-degree d, that is a strongly 
connected graph with in-degree=out-degree at each vertex. This is the natural generalization 
of an undirected multigraph into the directed graph setting. Let d(x, y) denote the number of 
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directed edges from x to y, so that d(x) = ^2 y d(x,y) is the degree of x. 
has uniform stationary distribution and transition probabilities P(x,y) 



P(x, x) = 1 



d(x,y)—d(x,x) _ 



The max-degree walk 

_ d(x,y) 
d 



if y 7^ x and 



; the walk is lazy if there are at least d/2 self- loops at each vertex. 



By strong connectivity, for every x ^ y £ V there is a shortest path ^y xy with \ j X y\ < n — 1. 
In order to eliminate significant periodicity effects it is sufficient to assume that every vertex has 
a single self-loop, i.e. holding probability a > 1/d. Then, even if every path passes through some 
vertex v and edge e, then p v < 2 ^ x ) = § (half size because v E j X y ^lyx) and p e < j^, and so by 



Theorem 4.2 



T x (e) < 



dn 



2 log- 



In contrast, the Poincare method (|2.l| ) can show a comparable bound only for the lazy walk, while 
the odd path length ( |2.2j ) only for the reversible case (i.e. d(x,y) = d(y,x) for every x,y). If ( [2.1D 
or (^^) were used to study the Eulerian case then a factor of d would be lost in either case. 

More generally, without the self- loop condition, if the graph is connected under paths j xy of 
odd length alternating between edges of E = {(x,y) : d(x,y) > 1} and E* = {(x,y) : d(y,x) > 1} 
then p v < = n and Pg > 1/d. By Theorem |4.5| , 



r*(e) < 



40 dn 1 log - 

e 



In comparison, ( p.2j ) can show a comparable bound only in the reversible case, while if ( |2 . 3| ) is used 
to study this then at worst p^ p = Q(nd 2 ) and and so a factor of d is lost. 

Our methods have thus shown that canonical path methods need not lose a factor of d when 
going from the undirected to the Eulerian directed case, both with the self-loop requirement and 
without. 

A more interesting problem is walks on Cayley graphs, that is, random walks on groups. 

Example 5.2. Consider a group G with (non-symmetric) generating set S. The Cayley graph has 
edge set (g, gs) for all g £ G, s G S. If p is a probability distribution on S then consider the walk 



P(flS 5 s ) = P( s )- Babai M showed a mixing bound of r x (e) = 0(- 



log(|G|/e)) for the lazy 



> min sS sp(s) 

walk on Cayley graphs (i.e. p(id) > 1/2) with symmetric generating set and diameter A. Diaconis 
and Saloff-Coste used canonical paths to extend this to the non-symmetric lazy case, and to the 
symmetric case if merely id S S. Our path methods show there is no need to differentiate between 
symmetric and non-symmetric cases, that is, the same bound holds for the non-symmetric case 
with id E S; we show a similar bound holds even if id ^ S. 

First the case of id E S, so there is a holding probability. To each g E G write g = si S2 • ■ ■ Sfc 
as a product of generators. Let A = max|<7| be the length of the longest such representation, and 
N(g, s) < A denote the number of times generator s appears in the representation of g. Then the 
argument of [0] is easily extended (see Appendix) to show that there are canonical paths such that 



Pv < A, 



p e < max 

gGG,s£S-id 



N(g,s) 
p(s) 



It follows from Theorem 4.2 that 



r x (e) < 



4A 2 / 1 2IGI , 1 
— pr - log J-I + log - 
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This is (up to a small constant) the same bound as Babai, and Diaconis and Saloff-Coste showed 
in weaker settings. If ( |2.l|) were used to extend their bounds to this more general setting it would 
have had the weaker p(id) mm se s-idp(s) < (mm s ^s p(s)) 2 in the denominator. 

Now suppose the identity is not in S. Then, if the graph has diameter A* where diameter is 
measured along odd length paths alternating between elements of S and S* -1 , the argument in the 
Appendix applies to p v as well, but with vertices at odd positions of path ^ xy , and without 

1 — ir(g) being subtracted. Hence p v < 1+ 2 A and so 

. '10(1 + A*) 2 (\ , 2\G\ , 1 



/ n i log -i , a i + log ■ 
mm sg 5p(sj \2 1 + A* e 

In comparison, ( |2.2j ) can show this only with symmetric generating set, while if fl2.3j ) is used to study 
the general setting then it results in mm ss > e s S* p{s)p* (s r ) = (min se s p(s)) 2 in the denominator. 

Our methods have thus shown the squaring of min p(s) to be unnecessary when going from the 
symmetric to non-symmetric setting, both in the case of small holding probability and the case of 
no holding probability. 
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Appendix 

In order to study walks on Cayley graphs we claimed that a result of || generalizes easily. Details 
can be found below. 

Lemma 5.3. Consider group G with (non- symmetric) generating set S, and to each g £ G write 
g = s\ S2 ■ ■ ■ Sk as a product of generators. Let A = max \g\ be the length of the longest such 
representation, and N(g, s) denote the number of times generator s appears in the representation 
of g. If p is a probability distribution on S then the random walk with transitions P(g,gs) = p(s) 
has canonical paths such that 

N(g,s) 

p v < A, p e < max — -— . 

geG,seS p(s) 

If the generating set is symmetric then p v < =i^- . 

Proof. Given x, y £ G let g = x~ l y = s\ S2 • • • Sfc and path j XjV be given by x — ► xs\ xg = 

y- 

First vertex-congestion. The same number of paths pass through each vertex, because if 7^ „ 
includes vertex v then 'J X v- 1 w,yv- 1 w includes vertex w, and vice-versa. Then, by equation (|4,4j) , 

/ \ 

p» - JnYl ^Tn\ ^ ir(iWy)-(i-ff(j)) 



\G\ 

1 1 gee 



7r(g) 

. x,yeG\xyty, , 



< E (l + W)T(xMy)-i^Li 

x,y£G I x=£y 



For the undirected case use the form of p v given in equation (|4.5| ). 

Now edge-congestion. For fixed s E S, the same number of paths pass through each edge (g, gs) 
V<? G G, because if 7^ includes edge (v, vs) then J xv -i w> y V -i w includes edge (w, ws), and vice- versa. 
Hence, 

pe = ^ , } E <*)<y) 

< max— E N(x~ l y, s)7r(x)7r(y) 
ses pis) *—f 

1K ' x,y&G\x^y 
1 

< max —— max N(q, s) . 
s&S p(s) g€G vy ' ' 

□ 
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